Search Results
Quilt-LLaVA: Visual Instruction Tuning by Extracting Localized Narratives from Histopathology Videos
Visual Instruction Tuning using LLaVA
LLaVA - the first instruction following multi-modal model (paper explained)
Multimodal Foundations and Large Language Models - DLAI8